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PAPILLOMA VIRUS CAPSOMERE VACCD4E 
FORMULATIONS AND METHODS OF USE 

FIELD OF THE EWENTION 
The present invention relates to vaccine formulations 
5 comprising papilloma vims proteins, either as fusion proteins, truncated 
proteins, or truncated fusion proteins. The invention further embraces 
methods for producing capsomeres of the formulations, as well as 
prophylactic and therapeutic methods for their use. 

BACKGROUND 

10 Infections with certain high-risk strains of genital papilloma 

viruses in humans (HPV) - for example. HPV 16, 18, or 45 are 
believed to be the main risk factor for the formation of malignant tumors of 
the anogenital tract. Of the possible malignancies, cervical carcinoma is by 
far the most frequent: according to un estimate by the World Health 

15 Organization (WHO), almost 500.000 new cases of the disease occur 

annually. Because of the frequency with which this pathology occurs, the 
connection between HPV infection and cervical carcinoma has been 
extensively examined, leading to numerous generalizations. 

For example, precursor lesions of cervical intraepithelial 

20 neoplasia (CIN) are known to be caused by papilloma virus infections 

[Crum, New Eng. J. Med. i/O:880-883 (1984)]. DNA from the genomes 
of certain HPV types, including for example, strains 16, 18, 33, 35, and 
45. have been detected in more than 95% of tumor biopsies from patients 
with this disorder, as well as in primary cell lines cultured from the 

25 tumors. Approximately 50 to 70% of the biopsied CIN tumor cells have 
been found to include DNA derived only from HPV 16. 

The protein products of the HPV 16 and HPV 18 early genes 
E6 and E7 have been detected in cervical carcinoma cell lines as well as in 
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by BPV VLPs. The mouse sera was therefore positive for neutralizing antibodies 
against the human VLPs and this differential neutralization was most likely the 
result of antibody specificity for epitopes against which the antibodies were raised. 

Numerous modifications and variations in the invention as set forth 
5 in the above illustrative examples are expected to occur to those skilled in the art. 
Consequently only such limitations as appear in the appended claims should be 
placed on the invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(ii) TITLE OF INVENTION: Papilloma Virus Capsomere Vaccine Formulations 
and Mechods of Use 

(iii) NUMBER OF SEQUENCES: 2 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Marshall, O' Toole, Gerstein, Murray & Borun 

(B) STREET: 23 3 South Wacker Drive, 6300 Sears Tower 

(C) CITY: Chicago 

(D) STATE: Illinois 

(E) COUNTRY: United States of America 

(F) ZIP: 60606-6402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppv disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS/MS - DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFOPJ4ATI0N : 

(A) NAME: Williams Jr., Joseph A. 

(B) REGISTRATION NUMBER: 38,659 

iC) REFERENCE/DOCKET NUMBER: 27013/34028 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 312-474-6300 

(B) TELEFAX: 312-474-0448 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1518 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1518 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATG TCT CTT TGG CTG CCT AGT GAG GCC ACT GTC TAC TTG CCT CCT GTC 4 8 

Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 
15 10 15 



CCA GTA TCT AAG GTT GTA AGC ACG GAT GAA TAT GTT GCA CGC ACA AAC 
Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn 
20 25 30 



96 
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ATA TAT TAT CAT GCA GGA ACA TCC AGA CTA CTT GCA GTT GGA CAT CCC 144 
lie Tyr Tyr His Ala Gly Thr Ser Arg Leu Leu Ala Val Gly His Pro 
35 40 45 



TAT TTT OCT ATT AAA AAA OCT AAC AAT AAC AAA ATA TTA GTT OCT AAA 
Tyr Phe Pro He Lys Lys Pro Asn Asn Asn Lys He Leu Val Pro Lys 
50 55 60 



AAT AAG TTT GGT TTT CCT GAG ACC TCA TTT TAT AAT CCA GAT ACA CAG 
Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr Gin 
85 90 95 



GAC GAT TTA TAG ATT AAA GGC TCT GGG TCT ACT GCA AAT TTA GCC AGT 
Asp Asp Leu Tyr He Lys Gly Ser Gly Ser Thr Ala Asn Leu Ala Ser 
^ 275 280 285 



19: 



GTA TCA GGA TTA CAA TAC AGG GTA TTT AGA ATA CAT TTA CCT GAC CCC 240 
Val Ser Gly Leu Gin Tyr Arg Val Phe Arg He His Leu Pro Asp Pro 
65 70 75 80 



288 



CGG CTG GTT TGG GCC TGT GTA GGT GTT GAG GTA GGT CGT GGT CAG CCA 336 
Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gin Pro 
100 105 110 

TTA GGT GTG GGC ATT AGT GGC CAT CCT TTA TTA AAT AAA TTG GAT GAC 3 84 

Leu Gly Val Gly He Ser Gly His Pro Leu Leu Asn Lys Leu Asp Asp 
115 120 125 

ACA GAA AAT GCT AGT GCT TAT GCA GCA AAT GCA GGT GTG GAT AAT AGA 4 32 

Thr Glu Asn Ala Ser Ala Tyr Ala Ala Asn Ala Gly Val Asp Asn Arg 
130 135 140 

GAA TGT ATA TCT ATG GAT TAC AAA CAA ACA CAA TTG TGT TTA ATT GGT 4 80 

Glu Cys He Ser Met Asp Tvr Lvs Gin Thr Gin Leu Cys Leu He Gly 
145 150 ' ' 155 160 

TGC AAA CCA CCT ATA GGG GAA CAC TGG GGC AAA GGA TCC CCA TGT ACC 52 8 

CVS Lys Pro Pro He Gly Glu His Trp Gly Lys Gly Ser Pro Cys Thr 
165 170 175 

AAT GTT GCA GTA AAT CCA GGT GAT TGT CCA CCA TTA GAG TTA ATA AAC 57 6 

Asn Val Ala Val Asn Pro Gly Asp Cys Pro Pro Leu Glu Leu He Asn 
180 185 190 

ACA GTT ATT CAG GAT GGT GAT ATG GTT GAT ACT GGC TTT GGT GCT ATG 624 
Thr Val He Gin Asp Gly Asd Met Val Asp Thr Gly Phe Gly Ala Met 
195 200 205 

GAC TTT ACT ACA TTA CAG GCT AAC AAA AGT GAA GTT CCA CTG GAT ATT 672 
Asp Phe Thr Thr Leu Gin Ala Asn Lys Ser Glu Val Pro Leu Asp He 
210 215 220 

TGT ACA TCT ATT TGC AAA TAT CCA GAT TAT ATT AAA ATG GTG TCA GAA 7 20 

Cvs Thr Ser He Cys Lys Tyr Pro Asp Tyr He Lys Met Val Ser Glu 
225 230 235 240 

CCA TAT GGC GAC AGC TTA TTT TTT TAT TTA CGA AGG GAA CAA ATG TTT 768 
Pro Tyr Gly Asp Ser Leu Phe Phe Tyr Leu Arg Arg Glu Gin Met Phe 
245 250 255 

GTT AGA CAT TTA TTT AAT AGG GCT GGT GCT GTT GGT GAA AAT GTA CCA 816 
Val Arg His Leu Phe Asn Arg Ala Gly Ala Val Gly Glu Asn Val Pro 
■ 260 265 270 



864 
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TCA AAT TAT TTT CCT ACA CCT AGT GGT TCT ATG GTT ACC TCT GAT GCC 912 
Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala 
290 295 300 



CAA ATA TTC AAT AAA CCT TAT TGG TTA CAA CGA GCA CAG GGC CAC AAT 
Gin He Phe Asn Lys Pro Tyr Trp Leu Gin Arg Ala Gin Gly His Asn 
305 310 315 320 



960 



AAT GGC ATT TGT TGG GGT AAC CAA CTA TTT GTT ACT GTT GTT GAT ACT 1008 
Asn Gly He Cys Trp Gly Asn Gin Leu Phe Val Thr Val Val Asp Thr 
325 330 335 

ACA CGC AGT ACA AAT ATG TCA TTA TGT GCT GCC ATA TCT ACT TCA GAA 105 6 

Thr Arg Ser Thr Asn Met Ser Leu Cys Ala Ala He Ser Thr Ser Glu 
340 345 350 

ACT ACA TAT AAA AAT ACT AAC TTT AAG GAG TAC CTA CGA CAT GGG GAG 1104 
Thr Thr Tyr Lys Asn Thr Asn Phe Lys Glu Tyr Leu Arg His Gly Glu 
355 360 365 

GAA TAT GAT TTA CAG TTT ATT TTT CAA CTG TGC AAA ATA ACC TTA ACT 1152 
Glu Tyr Asp Leu Gin Phe He Phe Gin Leu Cys Lys He Thr Leu Thr 
370 375 380 

GCA GAC GTT ATG ACA TAC ATA CAT TCT ATG AAT TCC ACT ATT TTG GAG 1200 
Ala Asp Val Met Thr Tyr He His Ser Met Asn Ser Thr He Leu Glu 
385 390 395 400 

GAC TGG AAT TTT GGT CTA CAA CCT CCC CCA GGA GGC ACA CTA GAA GAT 124 8 

Asp Trp Asn Phe Gly Leu Gin Pro Pro Pro Gly Gly Thr Leu Glu Asp 
405 410 415 

ACT TAT AGG TTT GTA ACC TCC CAG GCA ATT GCT TGT CAA AAA CAT ACA 129 6 

Thr T>-r Arg Phe Val Thr Ser Gin Ala He Ala Cys Gin Lys His Thr 
420 425 430 

CCT CCA GCA CCT AAA GAA GAT CCC CTT AAA AAA TAC ACT TTT TGG GAA 1344 
Pro Pro Ala Pro Lys Glu Asp Pro Leu Lys Lys Tyr Thr Phe Trp Glu 
435 440 445 

GTA AAT TTA AAG GAA AAG TTT TCT GCA GAC CTA GAT CAG TTT CCT TTA 1392 
Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gin Phe Pro Leu 
450 455 460 

GGA CGC AAA TTT TTA CTA CAA GCA GGA TTG AAG GCC AAA CCA AAA TTT 1440 
Gly Arg Lys Phe Leu Leu Gin Ala Gly Leu Lys Ala Lys Pro Lys Phe 
465 470 475 480 

ACA TTA GGA AAA CGA AAA GCT ACA CCC ACC ACC TCA TCT ACC TCT ACA 14 88 

Thr Leu Gly Lys Arg Lys Ala Thr Pro Thr Thr Ser Ser Thr Ser Thr 
485 490 495 

ACT GCT AAA CGC AAA AAA CGT AAG CTG TAA 1518 
Thr Ala Lys Arg Lys Lys Arg Lys Leu * 
500 505 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Leu Trp Leu Pro Ser Glu Ala Thr Val Tyr Leu Pro Pro Val 
15 10 15 

Pro Val Ser Lys Val Val Ser Thr Asp Glu Tyr Val Ala Arg Thr Asn 
20 25 30 

lie Tyr Tyr His Ala Gly Thr Ser Arg Leu Leu Ala Val Gly His Pro 
35 40 45 

Tyr Phe Pro lie Lys Lys Pro Asn Asn Asn Lys lie Leu Val Pro Lys 
50 55 60 

Val Ser Gly Leu Gin Tyr Arg Val Phe Arg He His Leu Pro Asp Pro 
65 70 75 80 

Asn Lys Phe Gly Phe Pro Asp Thr Ser Phe Tyr Asn Pro Asp Thr Gin 
85 90 95 

Arg Leu Val Trp Ala Cys Val Gly Val Glu Val Gly Arg Gly Gin Pro 
100 105 110 

Leu Gly Val Gly He Ser Glv His Pro Leu Leu Asn Lys Leu Asp Asp 
115 120 125 

Thr Glu Asn Ala Ser Ala Tyr Ala Ala Asn Ala Gly Val Asp Asn Arg 
130 135 140 

Glu Cys He Ser Met Asp Tyr Lys Gin Thr Gin Leu Cys Leu He Gly 
145 150 155 160 

Cvs Lys Pro Pro He Gly Glu His Trp Gly Lys Gly Ser Pro Cys Thr 
165 170 175 

Asn Val Ala Val Asn Pro Glv Asp Cys Pro Pro Leu Glu Leu He Asn 
180 ' 185 190 

Thr Val He Gin Asp Gly Asp Met Val Asp Thr Gly Phe Gly Ala Met 
195 200 205 

Asp Phe Thr Thr Leu Gin Ala Asn Lys Ser Glu Val Pro Leu Asp He 
210 215 220 

Cys Thr Ser He Cys Lys Tyr Pro Asp Tyr He Lys Met Val Ser Glu 
225 230 235 240 

Pro Tvr Gly Asp Ser Leu Phe Phe Tyr Leu Arg Arg Glu Gin Met Phe 
245 250 255 

Val Arg His Leu Phe Asn Arg Ala Gly Ala Val Gly Glu Asn Val Pro 
260 265 270 

Asp Asp Leu Tyr He Lys Gly Ser Gly Ser Thr Ala Asn Leu Ala Ser 
275 280 285 

Ser Asn Tyr Phe Pro Thr Pro Ser Gly Ser Met Val Thr Ser Asp Ala 
290 295 300 
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Gin lie Phe Asn Lys Pro Tyr Trp Leu Gin Arg Ala Gin Glv His Asn 
305 310 315 320 

Asn Gly He Cys Trp Gly Asn Gin Leu Phe Val Thr Val Val Asp Thr 
325 330 335 

Thr Arg Ser Thr Asn Met Ser Leu Cys Ala Ala He Ser Thr Ser Glu 
340 345 350 

Thr Thr Tyr Lys Asn Thr Asn Phe Lys Glu Tyr Leu Arg His Gly Glu 
355 360 365 

Glu Tyr Asp Leu Gin Phe He Phe Gin Leu Cys Lvs He Thr Leu Thr 
370 375 380 

Ala Asp Val Met Thr Tyr He His Ser Met Asn Ser Thr He Leu Glu 
385 390 395 400 

Asp Trp Asn Phe Gly Leu Gin Pro Pro Pro Gly Gly Thr Leu Glu Asp 
405 410 415 

Thr Tyr Arg Phe Val Thr Ser Gin Ala He Ala Cys Gin Lys His Thr 
420 425 430 

Pro Pro Ala Pro Lys Glu Asp Pro Leu Lys Lys Tyr Thr Phe Trp Glu 
435 440 445 

Val Asn Leu Lys Glu Lys Phe Ser Ala Asp Leu Asp Gin Phe Pro Leu 
450 455 460 

Gly Arg Lys Phe Leu Leu Gin Ala Gly Leu Lys Ala Lys Pro Lys Phe 
465 470 475 ^qq 

Thr Leu Gly Lys Arg Lys Ala Thr Pro Thr Thr Ser Ser Thr Ser Thr 
485 490 495 

Thr Ala Lys Arg Lys Lys Arg Lvs Leu 
500 ' 505 



(2) INFORMATION FOR SEQ ID NO : 3 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .29 7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATG CAT GGA GAT ACA CCT ACA TTG CAT GAA TAT ATG TTA GAT TTG CAA 
Met His Gly Asp Thr Pro Thr Leu His Glu Tyr Met Leu Asp Leu Gin 
^5 10 15 

CCA GAG ACA ACT GAT CTC TAC TGT TAT GAG CAA TTA AAT GAC AGO TCA 
Pro Glu Thr Thr Asp Leu Tyr Cys Tyr Glu Gin Leu Asn Asp Ser Ser 
20 25 30 
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GAG GAG GAG GAT GAA ATA GAT GGT CCA GCT GGA CAA GCA GAA CCG GAC 144 
Glu Glu Glu Asp Glu He Asp Gly Pro Ala Gly Gin Ala Glu Pro Asp 
35 40 45 

AGA GCC CAT TAC AAT ATT GTA ACC TTT TGT TGC AAG TGT GAC TCT ACG 192 
Arq Ala His Tyr Asn He Val Thr Phe Cys Cys Lys Cys Asp Ser Thr 
50 55 60 

CTT CGG TTG TGC GTA CAA AGC ACA CAC GTA GAC ATT CGT ACT TTG GAA 24 0 

Leu Arg Leu Cys Val Gin Ser Thr His Val Asp He Arg Thr Leu Glu 
65 70 75 80 



GAC CTG TTA ATG GGC ACA CTA GGA ATT GTG TGC CCC ATC TGT TCT CAG 
ASP Leu Leu Met Gly Thr Leu Gly He Val Cys Pro He Cys Ser Gin 
85 90 95 

AAA CCA TAA 
Lys Pro * 

(2) INFORMATION FOR SEQ ID NO ; 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met His Gly Asp Thr Pro Thr Leu His Glu Tyr Met Leu Asp Leu Gin 
1 5 10 15 

Pro Glu Thr Thr Asp Leu T>'r Cys Tyr Glu Gin Leu Asn Asp Ser Ser 
20 25 30 

Glu Glu Glu Asp Glu He Asp Gly Pro Ala Gly Gin Ala Glu Pro Asp 
35 40 45 



288 



297 



Arg Ala His Tyr Asn He Val Thr Phe Cys Cys Lys Cys Asp Ser Thr 
50 55 60 

Leu Arg Leu Cys Val Gin Ser Thr His Val Asp He Arg Thr Leu Glu 
65 70 75 80 

Asp Leu Leu Met Gly Thr Leu Gly He Val Cys Pro He Cys Ser Gin 
85 90 95 

Lys Pro * 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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CCCCGATATC GCCTTTAATG TATAAATCGT CTGG 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCCCGATATC TCAAATTATT TTCCTACACC TAGTG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
AAAGATATCT TGTAGTAAAA ATTTGCGTCC TAAAGGAAAC 
(2 1 INFORMATION FOR SEQ ID NO : 8 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
AAAGATATCT AATCTACCTC TACAACTGCT AAACGCAAAA AACG 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

AAAAGATATC ATGCATGGAG ATACACCTAC ATTGC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TTTTGATATC GGCTCTGTCC GGTTCTGCTT GTCC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TTTTGATATC CTTGCAACAA AAGGTTACAA TATTGTAATG GGCC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 mear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAAAGATATC TGGTTTCTGA GAACAGATGG GGCAC 
(2) INFORI^TION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTTTGATATC GATTATGAGC AATTAAATGA CAGCTCAG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
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TTTTGATATC GTCTACGTGT GTGCTTTGTA CGCAC 
(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 
TTTATCGATA TCGGTCCAGC TGGACAAGCA GAACCGGAC 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TTTTGATATC GATGCCCATT ACAATATTGT AACCTTTTG 



C2) INFORMATION FOR SEQ ID NO: 17: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .294 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ATG AGT CTT CTA ACC GAG GTC GAA ACG CTT ACC AGA AAC GGA TGG GAG 
Met Ser Leu Leu Thr Glu Val Glu Thr Leu Thr Arg Asn Gly Trp Glu 
1 5 . 10 15 

TGC AAA TGC AGC GAT TCA AGT GAT CCT CTC ATT ATC GCA GCG AGT ATC 
Cys Lys Cys Ser Asp Ser Ser Asp Pro Leu He He Ala Ala Ser He 
20 25 30 

ATT GGG ATC TTG CAC TTG ATA TTG TGG ATT TTT TAT CGT CTT TTC TTC 
He Gly He Leu His Leu He Leu Trp He Phe Tyr Arg Leu Phe Phe 
35 40 45 

AAA TGC ATT TAT CGT CGC CTT AAA TAG GGT TTG AAA AGA GGG CCT TCT 
Lys Cys He Tyr Arg Arg Leu Lys Tyr Gly Leu Lys Arg Gly Pro Ser 
50 55 60 
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nrr CAA GGA GCG CCT GAG TCT ATG AGG GAA GAA TAT CGG CAG GAA CAG 
Thr ^ fly Ala pro Glu Ser Met Arg Glu Glu T^r Arg Gin Glu Gin 
65 

CAG AGT GCT GTG GAT GTT GAC GAT GTT CAT TTT GTC AAC ATA GAG CTG 

85 



ran AGT GCT GTG GAT GTT GAC GAT b'rr L^i m ««v, ™ 

ser All Val Asp Val Asp Asp Val His Phe Val Asn He Glu Leu 



GAG TAA 
Glu * 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Ser Leu Leu Thr Glu Val Glu Thr Leu Thr Arg Asn Gly Trp Glu 



1 



5 lt> 



Cys Lys Cys Ser Asp Ser Ser Asp Pro Leu He He Ala Ala Ser He 



240 



288 



294 



He Gly He Leu His Leu He Leu 
35 40 

Lys Cys He Tyr Arg Arg Leu Lys 
50 55 

Thr Glu Gly Ala Pro Glu Ser Met 
65 

Gin Ser Ala Val Asp Val Asp Asp 
85 



Trp He Phe Tyr Arg Leu Phe Phe 
45 

Tyr Gly Leu Lys Arg Gly Pro Ser 
60 

Arq Glu Glu Tyr Arg Gin Glu Gin 
75 80 

Val His Phe Val Asn He Glu Leu 
90 95 



Glu * 

(2) INFORMATION FOR SEQ ID N0:l9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTTTGATATC GATATGGAAT GGCTAAAGAC AAGACCAATC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTTTGATATC GTTGTTTGGA TCCCCATTCC CATTG 
i2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTTATGACAT ACATACATTC TATG 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 22: 
CCATGCATTC CTGCTTGTAG TAAAAATTTG CGTCC 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CTACAAGCAG GAATGCATGG AGATACACC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CATCTGAAGC TTAGTAATGG GCTCTGTCCG GTTCTG 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CATCTGAAGC TTATCAATAT TGTAATGGGC TCTGTCCG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CATCTGAAGC TTACTTGCAA CAAAAGGTTA CAATATTGTA ATGGGCTCTG TCCG 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CATCTGAAGC TTAAAGCGTA GAGTCACACT TGCAACAAAA GGTTACAATA TTGTAATGGG 
CTCTGTCCG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
CATCTGAAGC TTATTGTACG CACAACCGAA GCGTAGAGTC ACACTTG 
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WHAT IS CLAIMED IS: 

1 . A vaccine formulation comprising a human papilloma 
virus capsomere, said capsomere comprising a fusion protein comprising a 
human papilloma virus LI protein adjacent amino acid residues from a 
second protein. 

2. A vaccine formulation comprising a human papilloma 
virus capsomere, said capsomere comprising a truncated human papilloma 
vims LI protein having a deletion of one or more amino acid residues 
nece5sar>' for formation of a virus-like particle. 

3. The vaccine fonnulation of claim 2 wherein said 
capsomere comprises a fusion protein comprising a truncated human 
papilloma virus LI protein adjacent amino acid residues from a second 
protein. 

4. The vaccine fonnulation of any one of claims 1,2, or 
3 wherein the LI protein is encoded in the genome of a human papilloma 
vims selected from the group consisting of HPV6, HPVIL HPV16, 
HPVI8. HPV33, HPV35. and HPV45. 

5. The vaccine fonnuiation of claim 4 wherein the 
papilloma vims is HVPI6. 

6. The vaccine romiulation of any one of claims 2, 3, or 
5 wherein carboxy lenninal amino acid residues are deleted from the LI 
protein. 

7. The vaccine formulation of claim 6 wherein 1 to 34 
carboxy terminal amino acid residues are deleted from the LI protein. 
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8. The vaccine formulation of claim 7 wherein 34 
carboxy terminal amino acid residues are deleted from the LI protein. 

9. The vaccine formulation of any one of claims 2, 3, or 
5 wherein amino terminal amino acid residues are deleted from the LI 
protein. 

10. The vaccine formulation of any one of claims 2, 3, or 
5 wherein internal amino acid residues are deleted from the LI protein. 

1 1 . The vaccine formulation of claim 10 wherein the 
amino acid residues deleted from the LI protein comprise a nuclear 
localization signal. 

12. The vaccine fonnulation of claims 2 or 3 wherein the 
amino acids residues from the second protein are derived from an HPV 
protein. 

13. The vaccine fonnulation of claim 12 wherein the 
HPV protein is an early HPV protein. 

14. The vaccine u-muilaiion of claim 12 wherein the 
eady HPV protein is selected from the group consisting of El, E2, E3, E4, 
E5. E6, and E7. 

15. A method of treating an individual infected with an 
HPV virus comprising the step of administering to a patient in need thereof 
an amount of the vaccine fonnulation of claims 1, 2, 3, 5, 7, 8, 11 , 13 or 
14 effective to reduce the level of HPV infection. 
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16. A method for preventing papilloma virus infection 
comprising the step of administering to an individual susceptible thereto an 
amount of the vaccine formulation of claims 1, 2, 3, 5, 7, 8, 1 1, 13 or 14 
effective to inhibit HPV infection. 
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